Prompt caching with application inference profiles by rstrahan · Pull Request #281 · aws-solutions-library-samples/accelerated-intelligent-document-processing-on-aws

rstrahan · 2026-04-14T19:11:17Z

Issue #, if available:. #272

Prompt caching with application inference profiles — Fixed <<CACHEPOINT>> tags being stripped when using Bedrock application inference profile ARNs as model IDs. The cachepoint check now resolves inference profile ARNs to their underlying foundation model via the GetInferenceProfile API, enabling prompt caching for profiles that wrap supported models (Claude, Nova). Results are cached to avoid repeated API calls, with graceful fallback if the API call fails. (#272)
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Also added application-inference-profile/* ARN pattern to bedrock:InvokeModel IAM policies across all templates (root, appsync, multi-doc-discovery, and sample templates). PR #236 previously fixed only patterns/unified/template.yaml; this completes the fix for all Lambda execution roles. Also added bedrock:GetInferenceProfile read permission to support prompt caching resolution. (#272)

…rmissions

rstrahan added 2 commits April 14, 2026 13:48

fix(bedrock): Resolve inference profile ARNs for cachePoint support

a5f8b10

feat(version): bump version to 0.5.7-wip1 and update template.yaml pe…

4955c0c

…rmissions

rstrahan changed the base branch from main to develop April 14, 2026 19:11

rstrahan merged commit a5f0224 into develop Apr 14, 2026
5 checks passed

rstrahan mentioned this pull request Apr 14, 2026

Feature Request: Enable Cost Attribution via Tagging for Bedrock Inference Profiles and Textract Adapters #272

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prompt caching with application inference profiles#281

Prompt caching with application inference profiles#281
rstrahan merged 2 commits intodevelopfrom
fix/enable-cachepoints-for-inference-arns

rstrahan commented Apr 14, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rstrahan commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rstrahan commented Apr 14, 2026 •

edited

Loading